pocl: A Performance-Portable OpenCL Implementation
نویسندگان
چکیده
منابع مشابه
A High - Performance , Portable Implementation ofthe
MPI (Message Passing Interface) is a speciication for a standard library for message passing that was deened by the MPI Forum, a broadly based group of parallel computer vendors, library writers, and applications specialists. Multiple implementations of MPI have been developed. In this paper, we describe MPICH, unique among existing implementations in its design goal of combining portability wi...
متن کاملA High - Performance , Portable Implementation ofthe MPI
MPI (Message Passing Interface) is a speciication for a standard library for message passing that was deened by the MPI Forum, a broadly based group of parallel computer vendors, library writers, and applications specialists. Multiple implementations of MPI have been developed. In this paper, we describe MPICH, unique among existing implementations in its design goal of combining portability wi...
متن کاملA High Performance, Portable Distributed BLAS Implementation
In this paper, we give a report on recent developments for the Distributed BLAS (DBLAS) project. These include a powerful distributed matrix representation which yields a simple interface to the DBLAS, and the redesign the DBLAS algorithms terms of powerfuìspread' and`reduce' matrix communication operations for reasons of programmability. The DBLAS codes achieve portability by supporting BLACS ...
متن کاملA portable OpenCL implementation of generic particle-mesh and mesh-particle interpolation in 2D and 3D
Hybrid particle–mesh methods provide a versatile framework for simulating discrete and continuous systems. A key component is the forward and backward interpolation of particle data to mesh nodes. These interpolations typically account for a significant portion of the computational cost of a simulation. Due to its regular compute structure, interpolation admits SIMD parallelism, and several GPU...
متن کاملFrom CUDA to OpenCL: Towards a performance-portable solution for multi-platform GPU programming
In this work, we evaluate OpenCL as a programming tool for developing performanceportable applications for GPGPU. While the Khronos group developed OpenCL with programming portability in mind, performance is not necessarily portable. OpenCL has required performance-impacting initializations that do not exist in other languages such as CUDA. Understanding these implications allows us to provide ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Parallel Programming
سال: 2014
ISSN: 0885-7458,1573-7640
DOI: 10.1007/s10766-014-0320-y